On a correlational clustering of integers
نویسندگان
چکیده
Correlation clustering is a concept of machine learning. It was introduced in Bansal et al. [2], which gives a good overview of the mathematical background as well. Let G be a complete graph on n vertices and label its edges with + or − depending on whether the endpoints have been deemed to be similar or different. Consider a partition of the vertices. Two edges are in conflict with respect to the partition if they belong to the same class, but are different, or they belong to different classes although they are similar. The ultimate goal of correlation clustering is to find a partition with minimal number of conflicts. The special feature of this clustering is that the number of clusters is not specified. A typical application of correlation clustering is the classification of unknown topics of (scientific) papers. In this case the papers represent the nodes and two papers are considered to be similar, if one of them cite the other. The classes of an optimal clustering is then interpreted as the topics of the papers. The number of partitions of n vertices grows exponentially, hence there is no chance for exhaustive search at computer experiments. Bansal et al. [2] proved that to find an optimal clustering is NPhard. They also presented and analyzed algorithms for approximate solutions of the problem. The correlation clustering can be treated as an optimization problem: minimize the number of conflicts by moving the elements between clusters. Hence one can use the traditional and new optimization algorithms to find near optimal partition. Bakó and Aszalós [1] have implemented the traditional methods and invented some new ones.
منابع مشابه
On a sequence related to the coprime integers
The asymptotic behaviour of the sequence with general term $P_n=(varphi(1)+varphi(2)+cdots+varphi(n))/(1+2+cdots+n)$, is studied which appears in the studying of coprime integers, and an explicit bound for the difference $P_n-6/pi^2$ is found.
متن کاملEEH: AGGH-like public key cryptosystem over the eisenstein integers using polynomial representations
GGH class of public-key cryptosystems relies on computational problems based on the closest vector problem (CVP) in lattices for their security. The subject of lattice based cryptography is very active and there have recently been new ideas that revolutionized the field. We present EEH, a GGH-Like public key cryptosystem based on the Eisenstein integers Z [ζ3] where ζ3 is a primitive...
متن کاملOn the Diophantine Equation x^6+ky^3=z^6+kw^3
Given the positive integers m,n, solving the well known symmetric Diophantine equation xm+kyn=zm+kwn, where k is a rational number, is a challenge. By computer calculations, we show that for all integers k from 1 to 500, the Diophantine equation x6+ky3=z6+kw3 has infinitely many nontrivial (y≠w) rational solutions. Clearly, the same result holds for positive integers k whose cube-free part is n...
متن کاملA Note on Artinianess of Certain Generalized
Let ?: R0?R be a ring homomorphism and suppose that a and a0, respectively, are ideals of R and R0 such that is an Artinian ring. Let M and N be two finitely generated R-modules and suppose that (R0,m0) is a local ring. In this note we prove that the R-modules and are Artinian for all integers i and j, whenever and . Also we will show that if a is principal, then the R-modules and ...
متن کاملThe Graph Clustering Problem has a
The input to the Graph Clustering Problem consists of a sequence of integers m 1 ; :::; m t and a sequence of P t i=1 m i graphs. The question is whether the equivalence classes, under the graph isomorphism relation, of the input graphs have sizes which match the input sequence of integers. In this note we show that this problem has a (perfect) zero-knowledge interactive proof system.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1404.0904 شماره
صفحات -
تاریخ انتشار 2014